Improving the performance of GWW A dCSE Project
نویسنده
چکیده
In this report, we present the results of investigation into improving the performance of GWW, part of the Quantum Espresso suite of software for ab initio simulation. In particular, the 3D Fourier Transform was found to be a significant bottleneck to application scaling. Several alternative methods for the FFT transpose were implemented, and the performance of these was studied on HECToR (Phase 2a and 2b). Speedups of up to 400% (on 128 cores of HECToR Phase 2a) were demonstrated for the 3D FFT in isolation, which delivered benefits of in the range of 4-36% in full application benchmarks. A checkpoint and restart mechanism was also added to help jobs complete in under the 12 hour queue limit on HECToR.
منابع مشابه
Improving the scalability of CP2K on multi-core systems A dCSE Project
Six months of HECToR dCSE funding was given to implement mixed-mode OpenMP parallelism in CP2K, building on the results of an earlier successful dCSE project. Improved scalability of up to 8 times as many cores was demonstrated for a small benchmark, and a larger, inhomogeneous benchmark was shown to scale up to 9000+ cores. An increase in peak performance of up to 60% was also realised on HECT...
متن کاملImproving the performance of CP2K on HECToR A dCSE Project
This report presents the results of a HECToR dCSE project to improve the performance of CP2K, a freely available and popular Density Functional Theory code, on HECToR. Building on a recently implemented domain decomposition method, further optimisation of the code was performed, and significant performance gains were measured around 30% on 256 cores (for a generally representative benchmark) an...
متن کاملdCSE Fluidity-ICOM: High Performance Computing Driven Software Development for Next-Generation Modelling of the Worlds Oceans
During the course of this project dCSE Fluidity-ICOM has been transformed from a code that was primarily used on institution level clusters with typically 64 tasks used per simulation into a highly performing scalable code which can be run efficiently on 4096 cores of the current HECToR hardware (Cray XT4 Phase2a). Fluidity-ICOM has been parallelised with MPI and optimised for HECToR alongside ...
متن کاملFinal Performance Report MICROSTRUCTURE AND 3-D EFFECTS IN FRETTING FATIGUE OF TI ALLOYS AND NI-BASE SUPERALLOYS
Richard W. Neu, Ph.D. Project Director and co-Principal Investigator The GWW School of Mechanical Engineering School of Materials Science and Engineering Georgia Institute of Technology Atlanta, GA 30332-0405 404-894-3074 404-894-0186 (fax) [email protected] David L. McDowell, Ph.D. co-Principal Investigator The GWW School of Mechanical Engineering School of Materials Science and Engineeri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010